Masters

Project Memory: PrERT-CNM

Identity & Role

AI-Driven Privacy Risk Quantification Engine. Unifies transformer-based policy extraction (PrivacyBERT) with probabilistic risk modeling (Bayesian Networks). Aligns unstructured legal text against international standards (ISO/IEC, NIST, GDPR) using quantifiable indicators.

Current Project State

The project architecture and Month 1 sprint map are actively being scaffolded.

Activity Requirements: Month 1 Deep Analysis

Objective: Map measurable privacy principles from ISO/IEC, NIST, GDPR, IEEE and international data protection regulations into privacy indicators.

Analysis of Current State vs. Actual Requirements: Our initial execution scaffolded a basic JSON representation mapping abstract GDPR concepts (e.g., Article 5) to placeholder indicators. However, reviewing docs/Architecture-Stack.md and docs/Model.md, this is entirely insufficient. The final PrERT-CNM (Privacy BERT Contextual Neural Memory) model requires a Hierarchical Multi-Label Classification system (Top Level: High-level ISO domain; Bottom Level: Fine-grained requirements like Encryption Standards).

Therefore, Month 1's true deliverable must structurally align with this 2-stage model design.

Deep Work Breakdown for Month 1:

  1. Taxonomy Structuring (Top vs. Bottom Level): We must define universally applicable Top-Level Categories (e.g., Access Control, Data Retention). Within these, we define the Bottom-Level Attributes (e.g., Password Length, Encryption Standards).
  2. Cross-Framework Overlays (Universal Schema): The mapping must unify ISO/IEC, NIST, GDPR, and IEEE under communal privacy indicators. A single fine-grained attribute must link to its specific clause in GDPR (Art 32) and NIST AI RMF simultaneously.
  3. Measurability & Scoring Bounds: Indicators must be inherently quantifiable (e.g., Boolean existence flags or probability distributions) to properly parameterize the Bayesian Risk engine.
  4. Data Structure Overhaul (config/privacy_indicators.json): The current JSON loader expects a flat list of indicators per specific framework principle. This must be entirely refactored into a hierarchical knowledge graph that the CNM (Contextual Neural Memory) can traverse to apply specialized, fine-grained context rules.

Pivot Strategy: The config module mapping logic must be completely rewritten. We need to draft a comprehensive schema reflecting the Multi-Label Hierarchical requirements before moving on to Month 2.

Active Tasks

Next Steps

Architectural Decisions